## HOW TO RUN THIS CODE
This code consists of a single Python script, `jca.py`, that produces plots related to Jo Guldi's research on land and property in the Parliamentary Debates. The input data are provided at the top level of this folder, including two distance arrays and a tab-separated file of debate titles and metadata. They were generated in a separate phase of this project by running topic modeling on the Parliamentary Debates and computing distances between the debates and four seed documents using a variety of divergence metrics. 

Running `jca.py` will write three plots and two text files to the output folder. The text files list debate titles that are the most similar to the seed corpus according to some threshold, set by the user. In this case it is set to 1%.

## REQUIREMENTS
* Python 3.7.0 (Anaconda Distribution)
* os
* numpy 1.15.1
* pandas 0.23.4